Búsqueda | Portal Regional de la BVS

IHMCIF: An Extension of the PDBx/mmCIF Data Standard for Integrative Structure Determination Methods.

Vallat, Brinda; Webb, Benjamin M; Westbrook, John D; Goddard, Thomas D; Hanke, Christian A; Graziadei, Andrea; Peisach, Ezra; Zalevsky, Arthur; Sagendorf, Jared; Tangmunarunkit, Hongsuda; Voinea, Serban; Sekharan, Monica; Yu, Jian; Bonvin, Alexander A M J J; DiMaio, Frank; Hummer, Gerhard; Meiler, Jens; Tajkhorshid, Emad; Ferrin, Thomas E; Lawson, Catherine L; Leitner, Alexander; Rappsilber, Juri; Seidel, Claus A M; Jeffries, Cy M; Burley, Stephen K; Hoch, Jeffrey C; Kurisu, Genji; Morris, Kyle; Patwardhan, Ardan; Velankar, Sameer; Schwede, Torsten; Trewhella, Jill; Kesselman, Carl; Berman, Helen M; Sali, Andrej.

J Mol Biol ; : 168546, 2024 Mar 18.

Artículo en Inglés | MEDLINE | ID: mdl-38508301

RESUMEN

IHMCIF (github.com/ihmwg/IHMCIF) is a data information framework that supports archiving and disseminating macromolecular structures determined by integrative or hybrid modeling (IHM), and making them Findable, Accessible, Interoperable, and Reusable (FAIR). IHMCIF is an extension of the Protein Data Bank Exchange/macromolecular Crystallographic Information Framework (PDBx/mmCIF) that serves as the framework for the Protein Data Bank (PDB) to archive experimentally determined atomic structures of biological macromolecules and their complexes with one another and small molecule ligands (e.g., enzyme cofactors and drugs). IHMCIF serves as the foundational data standard for the PDB-Dev prototype system, developed for archiving and disseminating integrative structures. It utilizes a flexible data representation to describe integrative structures that span multiple spatiotemporal scales and structural states with definitions for restraints from a variety of experimental methods contributing to integrative structural biology. The IHMCIF extension was created with the benefit of considerable community input and recommendations gathered by the Worldwide Protein Data Bank (wwPDB) Task Force for Integrative or Hybrid Methods (wwpdb.org/task/hybrid). Herein, we describe the development of IHMCIF to support evolving methodologies and ongoing advancements in integrative structural biology. Ultimately, IHMCIF will facilitate the unification of PDB-Dev data and tools with the PDB archive so that integrative structures can be archived and disseminated through PDB.

ModelCIF: An Extension of PDBx/mmCIF Data Representation for Computed Structure Models.

Vallat, Brinda; Tauriello, Gerardo; Bienert, Stefan; Haas, Juergen; Webb, Benjamin M; Zídek, Augustin; Zheng, Wei; Peisach, Ezra; Piehl, Dennis W; Anischanka, Ivan; Sillitoe, Ian; Tolchard, James; Varadi, Mihaly; Baker, David; Orengo, Christine; Zhang, Yang; Hoch, Jeffrey C; Kurisu, Genji; Patwardhan, Ardan; Velankar, Sameer; Burley, Stephen K; Sali, Andrej; Schwede, Torsten; Berman, Helen M; Westbrook, John D.

J Mol Biol ; 435(14): 168021, 2023 07 15.

Artículo en Inglés | MEDLINE | ID: mdl-36828268

RESUMEN

ModelCIF (github.com/ihmwg/ModelCIF) is a data information framework developed for and by computational structural biologists to enable delivery of Findable, Accessible, Interoperable, and Reusable (FAIR) data to users worldwide. ModelCIF describes the specific set of attributes and metadata associated with macromolecular structures modeled by solely computational methods and provides an extensible data representation for deposition, archiving, and public dissemination of predicted three-dimensional (3D) models of macromolecules. It is an extension of the Protein Data Bank Exchange / macromolecular Crystallographic Information Framework (PDBx/mmCIF), which is the global data standard for representing experimentally-determined 3D structures of macromolecules and associated metadata. The PDBx/mmCIF framework and its extensions (e.g., ModelCIF) are managed by the Worldwide Protein Data Bank partnership (wwPDB, wwpdb.org) in collaboration with relevant community stakeholders such as the wwPDB ModelCIF Working Group (wwpdb.org/task/modelcif). This semantically rich and extensible data framework for representing computed structure models (CSMs) accelerates the pace of scientific discovery. Herein, we describe the architecture, contents, and governance of ModelCIF, and tools and processes for maintaining and extending the data standard. Community tools and software libraries that support ModelCIF are also described.

Asunto(s)

Bases de Datos de Proteínas , Sustancias Macromoleculares/química , Conformación Proteica , Programas Informáticos

ModBase, a database of annotated comparative protein structure models and associated resources.

Pieper, Ursula; Webb, Benjamin M; Dong, Guang Qiang; Schneidman-Duhovny, Dina; Fan, Hao; Kim, Seung Joong; Khuri, Natalia; Spill, Yannick G; Weinkam, Patrick; Hammel, Michal; Tainer, John A; Nilges, Michael; Sali, Andrej.

Nucleic Acids Res ; 42(Database issue): D336-46, 2014 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-24271400

RESUMEN

ModBase (http://salilab.org/modbase) is a database of annotated comparative protein structure models. The models are calculated by ModPipe, an automated modeling pipeline that relies primarily on Modeller for fold assignment, sequence-structure alignment, model building and model assessment (http://salilab.org/modeller/). ModBase currently contains almost 30 million reliable models for domains in 4.7 million unique protein sequences. ModBase allows users to compute or update comparative models on demand, through an interface to the ModWeb modeling server (http://salilab.org/modweb). ModBase models are also available through the Protein Model Portal (http://www.proteinmodelportal.org/). Recently developed associated resources include the AllosMod server for modeling ligand-induced protein dynamics (http://salilab.org/allosmod), the AllosMod-FoXS server for predicting a structural ensemble that fits an SAXS profile (http://salilab.org/allosmod-foxs), the FoXSDock server for protein-protein docking filtered by an SAXS profile (http://salilab.org/foxsdock), the SAXS Merge server for automatic merging of SAXS profiles (http://salilab.org/saxsmerge) and the Pose & Rank server for scoring protein-ligand complexes (http://salilab.org/poseandrank). In this update, we also highlight two applications of ModBase: a PSI:Biology initiative to maximize the structural coverage of the human alpha-helical transmembrane proteome and a determination of structural determinants of human immunodeficiency virus-1 protease specificity.

Asunto(s)

Bases de Datos de Proteínas , Modelos Moleculares , Homología Estructural de Proteína , Proteasa del VIH/química , Humanos , Internet , Proteínas de la Membrana/química , Anotación de Secuencia Molecular , Estructura Terciaria de Proteína , Proteoma/química , Dispersión del Ángulo Pequeño , Difracción de Rayos X

Assembly of macromolecular complexes by satisfaction of spatial restraints from electron microscopy images.

Velázquez-Muriel, Javier; Lasker, Keren; Russel, Daniel; Phillips, Jeremy; Webb, Benjamin M; Schneidman-Duhovny, Dina; Sali, Andrej.

Proc Natl Acad Sci U S A ; 109(46): 18821-6, 2012 Nov 13.

Artículo en Inglés | MEDLINE | ID: mdl-23112201

RESUMEN

To obtain a structural model of a macromolecular assembly by single-particle EM, a large number of particle images need to be collected, aligned, clustered, averaged, and finally assembled via reconstruction into a 3D density map. This process is limited by the number and quality of the particle images, the accuracy of the initial model, and the compositional and conformational heterogeneity. Here, we describe a structure determination method that avoids the reconstruction procedure. The atomic structures of the individual complex components are assembled by optimizing a match against 2D EM class-average images, an excluded volume criterion, geometric complementarity, and optional restraints from proteomics and chemical cross-linking experiments. The optimization relies on a simulated annealing Monte Carlo search and a divide-and-conquer message-passing algorithm. Using simulated and experimentally determined EM class averages for 12 and 4 protein assemblies, respectively, we show that a few class averages can indeed result in accurate models for complexes of as many as five subunits. Thus, integrative structural biology can now benefit from the relative ease with which the EM class averages are determined.

Asunto(s)

Algoritmos , Bases de Datos de Proteínas , Imagenología Tridimensional/métodos , Microscopía Electrónica/métodos , Complejos Multiproteicos/ultraestructura , Complejos Multiproteicos/química

SALIGN: a web server for alignment of multiple protein sequences and structures.

Braberg, Hannes; Webb, Benjamin M; Tjioe, Elina; Pieper, Ursula; Sali, Andrej; Madhusudhan, M S.

Bioinformatics ; 28(15): 2072-3, 2012 Aug 01.

Artículo en Inglés | MEDLINE | ID: mdl-22618536

RESUMEN

SUMMARY: Accurate alignment of protein sequences and/or structures is crucial for many biological analyses, including functional annotation of proteins, classifying protein sequences into families, and comparative protein structure modeling. Described here is a web interface to SALIGN, the versatile protein multiple sequence/structure alignment module of MODELLER. The web server automatically determines the best alignment procedure based on the inputs, while allowing the user to override default parameter values. Multiple alignments are guided by a dendrogram computed from a matrix of all pairwise alignment scores. When aligning sequences to structures, SALIGN uses structural environment information to place gaps optimally. If two multiple sequence alignments of related proteins are input to the server, a profile-profile alignment is performed. All features of the server have been previously optimized for accuracy, especially in the contexts of comparative modeling and identification of interacting protein partners. AVAILABILITY: The SALIGN web server is freely accessible to the academic community at http://salilab.org/salign. SALIGN is a module of the MODELLER software, also freely available to academic users (http://salilab.org/modeller). CONTACT: sali@salilab.org; madhusudhan@bii.a-star.edu.sg.

Asunto(s)

Secuencia de Aminoácidos , Proteínas/química , Alineación de Secuencia/métodos , Programas Informáticos , Biología Computacional/métodos , Internet , Interfaz Usuario-Computador

Macromolecular assembly structures by comparative modeling and electron microscopy.

Lasker, Keren; Velázquez-Muriel, Javier A; Webb, Benjamin M; Yang, Zheng; Ferrin, Thomas E; Sali, Andrej.

Methods Mol Biol ; 857: 331-50, 2012.

Artículo en Inglés | MEDLINE | ID: mdl-22323229

RESUMEN

Advances in electron microscopy allow for structure determination of large biological machines at increasingly higher resolutions. A key step in this process is fitting component structures into the electron microscopy-derived density map of their assembly. Comparative modeling can contribute by providing atomic models of the components, via fold assignment, sequence-structure alignment, model building, and model assessment. All four stages of comparative modeling can also benefit from consideration of the density map. In this chapter, we describe numerous types of modeling problems restrained by a density map and available protocols for finding solutions. In particular, we provide detailed instructions for density map-guided modeling using the Integrative Modeling Platform (IMP), MODELLER, and UCSF Chimera.

Asunto(s)

Sustancias Macromoleculares/química , Microscopía Electrónica/métodos , Modelos Moleculares , Proteínas/química , Secuencia de Aminoácidos , Chaperonina 60/química , Escherichia coli/química , Proteínas de Escherichia coli/química , Datos de Secuencia Molecular , Conformación Proteica , Alineación de Secuencia/métodos

ModBase, a database of annotated comparative protein structure models, and associated resources.

Pieper, Ursula; Webb, Benjamin M; Barkan, David T; Schneidman-Duhovny, Dina; Schlessinger, Avner; Braberg, Hannes; Yang, Zheng; Meng, Elaine C; Pettersen, Eric F; Huang, Conrad C; Datta, Ruchira S; Sampathkumar, Parthasarathy; Madhusudhan, Mallur S; Sjölander, Kimmen; Ferrin, Thomas E; Burley, Stephen K; Sali, Andrej.

Nucleic Acids Res ; 39(Database issue): D465-74, 2011 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-21097780

RESUMEN

ModBase (http://salilab.org/modbase) is a database of annotated comparative protein structure models. The models are calculated by ModPipe, an automated modeling pipeline that relies primarily on Modeller for fold assignment, sequence-structure alignment, model building and model assessment (http://salilab.org/modeller/). ModBase currently contains 10,355,444 reliable models for domains in 2,421,920 unique protein sequences. ModBase allows users to update comparative models on demand, and request modeling of additional sequences through an interface to the ModWeb modeling server (http://salilab.org/modweb). ModBase models are available through the ModBase interface as well as the Protein Model Portal (http://www.proteinmodelportal.org/). Recently developed associated resources include the SALIGN server for multiple sequence and structure alignment (http://salilab.org/salign), the ModEval server for predicting the accuracy of protein structure models (http://salilab.org/modeval), the PCSS server for predicting which peptides bind to a given protein (http://salilab.org/pcss) and the FoXS server for calculating and fitting Small Angle X-ray Scattering profiles (http://salilab.org/foxs).

Asunto(s)

Bases de Datos de Proteínas , Modelos Moleculares , Estructura Terciaria de Proteína , Proteínas Bacterianas/química , Gráficos por Computador , Péptidos/química , Mapeo de Interacción de Proteínas , Proteínas/química , Dispersión del Ángulo Pequeño , Alineación de Secuencia , Programas Informáticos , Homología Estructural de Proteína , Interfaz Usuario-Computador , Difracción de Rayos X

Molecular docking screens using comparative models of proteins.

Fan, Hao; Irwin, John J; Webb, Benjamin M; Klebe, Gerhard; Shoichet, Brian K; Sali, Andrej.

J Chem Inf Model ; 49(11): 2512-27, 2009 Nov.

Artículo en Inglés | MEDLINE | ID: mdl-19845314

RESUMEN

Two orders of magnitude more protein sequences can be modeled by comparative modeling than have been determined by X-ray crystallography and NMR spectroscopy. Investigators have nevertheless been cautious about using comparative models for ligand discovery because of concerns about model errors. We suggest how to exploit comparative models for molecular screens, based on docking against a wide range of crystallographic structures and comparative models with known ligands. To account for the variation in the ligand-binding pocket as it binds different ligands, we calculate "consensus" enrichment by ranking each library compound by its best docking score against all available comparative models and/or modeling templates. For the majority of the targets, the consensus enrichment for multiple models was better than or comparable to that of the holo and apo X-ray structures. Even for single models, the models are significantly more enriching than the template structure if the template is paralogous and shares more than 25% sequence identity with the target.

Asunto(s)

Modelos Moleculares , Proteínas/química , Cristalografía por Rayos X , Enlace de Hidrógeno , Ligandos , Resonancia Magnética Nuclear Biomolecular , Conformación Proteica

Alignment of multiple protein structures based on sequence and structure features.

Madhusudhan, M S; Webb, Benjamin M; Marti-Renom, Marc A; Eswar, Narayanan; Sali, Andrej.

Protein Eng Des Sel ; 22(9): 569-74, 2009 Sep.

Artículo en Inglés | MEDLINE | ID: mdl-19587024

RESUMEN

Comparing the structures of proteins is crucial to gaining insight into protein evolution and function. Here, we align the sequences of multiple protein structures by a dynamic programming optimization of a scoring function that is a sum of an affine gap penalty and terms dependent on various sequence and structure features (SALIGN). The features include amino acid residue type, residue position, residue accessible surface area, residue secondary structure state and the conformation of a short segment centered on the residue. The multiple alignment is built by following the 'guide' tree constructed from the matrix of all pairwise protein alignment scores. Importantly, the method does not depend on the exact values of various parameters, such as feature weights and gap penalties, because the optimal alignment across a range of parameter values is found. Using multiple structure alignments in the HOMSTRAD database, SALIGN was benchmarked against MUSTANG for multiple alignments as well as against TM-align and CE for pairwise alignments. On the average, SALIGN produces a 15% improvement in structural overlap over HOMSTRAD and 14% over MUSTANG, and yields more equivalent structural positions than TM-align and CE in 90% and 95% of cases, respectively. The utility of accurate multiple structure alignment is illustrated by its application to comparative protein structure modeling.

Asunto(s)

Proteínas/química , Proteínas/genética , Alineación de Secuencia/métodos , Algoritmos , Secuencia de Aminoácidos , Bases de Datos de Proteínas , Conformación Proteica , Análisis de Secuencia de Proteína

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA